A Theory of Reads and Writes for Multi-level Caches

نویسندگان

  • Tyler Sondag
  • Hridesh Rajan
چکیده

A wide range of program analysis and optimization techniques rely on the knowledge of the programs cache behavior. Thus soundness and precision of the underlying cache behavior model is important for these client analyses. Soundness and precision of the cache behavior models are becoming even more important with the recent advent of multi-core and many-core architectures due to two reasons. First, cache sizes are generally small for these architectures, thus precision becomes important. Second large scale sharing of the communication bandwidth between processors and memory potentially creates contention resulting in even larger hit on program’s execution-time due to a cache miss, thus the cost of imprecise analysis is higher. The main contribution of this work is a sound technique for cache behavior analysis that advances the state of the art in three dimensions. First, unlike previous work that only work with single-level caches, our technique is able to handle multi-level caches. Second, unlike previous work which handled either instruction or data caches, our technique handles both. Third, and as a consequence of the first advance, we are able to model a variety of multi-level cache policies. Each of these allow our analysis to more closely model cache behavior in currently available general-purpose multi-core and many-core processors such as the Intel Core 2 Duo, TILE64, etc. The technical device that underlies the theory is the definition of live caches. Live caches model relationships between cache levels and help improve the accuracy of analysis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Second-tier Cache Management to Support DBMS Workloads

Enterprise Database Management Systems (DBMS) often run on computers with dedicated storage systems. Their data access requests need to go through two tiers of cache, i.e., a database bufferpool and a storage server cache, before reaching the storage media, e.g., disk platters. A tremendous amount of work has been done to improve the performance of the first-tier cache, i.e., the database buffe...

متن کامل

Flash on Rails: Consistent Flash Performance through Redundancy

Modern applications and virtualization require fast and predictable storage. Hard-drives have low and unpredictable performance, while keeping everything in DRAM is still prohibitively expensive or unnecessary in many cases. Solid-state drives offer a balance between performance and cost and are becoming increasingly popular in storage systems, playing the role of large caches and permanent sto...

متن کامل

LAGA: A Software for Landscape Allocation using Genetic Algorithm

In this paper, Landscape Allocation using Genetic Algorithm (LAGA), a spatial multi-objective land use optimization software is introduced. The software helps in searching for optimal land use when multiple objectives such as suitability, area, cohesion and edge density indices are simultaneously involved. LAGA is a flexible and easy to use genetic algorithm-based software for optimizing the sp...

متن کامل

Management policies for non-volatile write caches

Many computer hardware and software architectures buffer data in memory to improve system performance. Volatile disk or file caches are sometimes used to delay the propagation of writes to disk (called delayed writes). While delayed writes improve system performance, volatile caches can cause the loss of vital data during sudden failure. In this study, we investigate managing non-volatile RAM (...

متن کامل

Control Flow Emulation on Tiled SIMD Architectures

Heterogeneous multi-core and streaming architectures such as the GPU, Cell, ClearSpeed, and Imagine processors have better power/ performance ratios and memory bandwidth than traditional architectures. These types of processors are increasingly being used to accelerate compute-intensive applications. Their performance advantage is achieved by using multiple SIMD processor cores but limiting the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009